Extending LOGS ontologizer to multi-language environments

ثبت نشده
چکیده

[Abstract: Ontologies greatly enhance our ability to machine process digital documents and aid in knowledge engineering. They also augment search algorithms. LOGS, Lightweight universal Ontology Generation and exploitation architectureS, and its sample application, the Eagle, took a step toward automatic ontology generation, through a fast, lightweight approach. Sanskritology and Indology are interesting domains where most of modern research is carried out in the West, often by transliterating Sanskrit texts into Roman scripts with many scholars relying on definitive translations of these texts for their study. However, as most of the Scriptural texts were preserved by oral traditions, through the patrilineal system, a number of recensions exist for various texts, some complete and some not; only a few of these were ever translated. In addition, a number of these books repeat portions of each other. Texts could be presented in native or exploded forms. The study itself is highly interpretive. Therefore, the validation, ontoligizing and rendition through intelligent interfaces, of various recensions of a particular book, present a very rich problem for methods such as LOGS. We describe the use of LOGS in an ontological server, which generates and maintains ontologies of Sanskrit texts and demonstrate its use on versions of the Sama Veda, one of the complete Hindu Scriptures. To do this, we need to process natural language in Sanskrit. To show how we accomplish this NLP, we describe the heuristic algorithms of the Vyakarana API we have developed, which is the first known adaptive Sanskrit grammar and transliteration tool for this area. As its grammar evolved with Sanskrit over centuries, Vyakarana is by design adaptive to its subject. Vyakarana is used in the ontology generation step of LOGS and is also used in intelligent domain queries such as generating concordances, dictionary lookup of compound words and in dynamic transliterations. Finally, we comment on how such an ontolgized book could be easily integrated into and cross-referenced in a searchable digital repository and the implications of a successful utilization of LOGS in Sanskritology to other domains.]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Unsupervised Spoken Language Understanding: Exploiting Query Click Logs for Slot Filling

In this paper, we present a novel approach to exploit user queries mined from search engine query click logs to bootstrap or improve slot filling models for spoken language understanding. We propose extending the earlier gazetteer population techniques to mine unannotated training data for semantic parsing. The automatically annotated mined data can then be used to train slot specific parsing m...

متن کامل

A Query Language for Analyzing Business Processes Execution

The execution of a business process (BP) in today’s enterprises may involve a workflow and multiple IT systems and services. Often no complete, up-to-date documentation of the model or correlation information of process events exist. Understanding the execution of a BP in terms of its scope and details is challenging specially as it is subjective: depends on the perspective of the person lookin...

متن کامل

Extending Access Management to maintain audit logs in cloud computing

considering the most often talked about security risks in cloud computing, like, security and compliance, viability, lack of transparency, reliability and performance issues. Bringing strong auditability in cloud services can reduce these risks to a great extent. Also, auditing, both internally and externally is generally required and sometimes unavoidable looking into the present day competiti...

متن کامل

Automatic Generation of a Multi Agent System for Crisis Management by a Model Driven Approach

Considering the increasing occurrences of unexpected events and the need for pre-crisis planning in order to reduce risks and losses, modeling instant response environments is needed more than ever. Modeling may lead to more careful planning for crisis-response operations, such as team formation, task assignment, and doing the task by teams. A common challenge in this way is that the model shou...

متن کامل

A programming method to estimate proximate parameters of coal beds from well-logging data using a sequential solving of linear equation systems

This paper presents an innovative solution for estimating the proximate parameters of coal beds from the well-logs. To implement the solution, the C# programming language was used. The data from four exploratory boreholes was used in a case study to express the method and determine its accuracy. Then two boreholes were selected as the reference, namely the boreholes with available well-logging ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004